首页> 外文OA文献 >GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel Applications

【2h】

GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel Applications

机译：GRE：用于大规模分布式图并行的图运行引擎应用

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large-scale distributed graph-parallel computing is challenging. On one hand,due to the irregular computation pattern and lack of locality, it is hard toexpress parallelism efficiently. On the other hand, due to the scale-freenature, real-world graphs are hard to partition in balance with low cut. Toaddress these challenges, several graph-parallel frameworks including Pregeland GraphLab (PowerGraph) have been developed recently. In this paper, wepresent an alternative framework, Graph Runtime Engine (GRE). While retainingthe vertex-centric programming model, GRE proposes two new abstractions: 1) aScatter-Combine computation model based on active message to exploit massivefined-grained edge-level parallelism, and 2) a Agent-Graph data model based onvertex factorization to partition and represent directed graphs. GRE isimplemented on commercial off-the-shelf multi-core cluster. We experimentallyevaluate GRE with three benchmark programs (PageRank, Single Source ShortestPath and Connected Components) on real-world and synthetic graphs of millionsbillion of vertices. Compared to PowerGraph, GRE shows 2.5~17 times betterperformance on 8~16 machines (192 cores). Specifically, the PageRank in GRE isthe fastest when comparing to counterparts of other frameworks (PowerGraph,Spark,Twister) reported in public literatures. Besides, GRE significantlyoptimizes memory usage so that it can process a large graph of 1 billionvertices and 17 billion edges on our cluster with totally 768GB memory, whilePowerGraph can only process less than half of this graph scale.

机译：大规模分布式图并行计算具有挑战性。一方面，由于计算模式不规则，缺乏局部性，很难有效地表达并行性。另一方面，由于比例尺的自由性，现实世界的图形很难通过低切来平衡分配。为了应对这些挑战，最近开发了包括Pregeland GraphLab（PowerGraph）在内的几种图形并行框架。在本文中，我们提出了一个替代框架Graph Runtime Engine（GRE）。在保留以顶点为中心的编程模型的同时，GRE提出了两个新的抽象概念：1）基于活动消息的散点合并计算模型，以利用大规模细粒度的边缘级并行性； 2）基于顶点因数分解的Agent-Graph数据模型进行分区和划分。表示有向图。 GRE在现成的商用多核群集上实现。我们在现实世界和数亿亿个顶点的合成图上，使用三个基准程序（PageRank，Single Source ShortestPath和Connected Components）对GRE进行了实验评估。与PowerGraph相比，GRE在8〜16台计算机（192个内核）上的性能提高了2.5〜17倍。特别是，与公开文献中报道的其他框架（PowerGraph，Spark，Twister）相比，GRE中的PageRank最快。此外，GRE显着优化了内存使用，因此它可以处理群集中拥有10亿个顶点和170亿条边的大型图，总共有768GB内存，而PowerGraph只能处理不到此图比例的一半。

著录项

作者
Yan, Jie; Tan, Guangming; Sun, Ninghui;
展开▼
作者单位

展开▼
年度 2013
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Distributed computing practice for large-scale science and engineering applications [J] . Shantenu Jha, Murray Cole, Daniel S. Katz, Concurrency and Computation . 2013,第11期

机译：面向大规模科学和工程应用的分布式计算实践
2. A DISTRIBUTED SDP APPROACH FOR LARGE-SCALE NOISYANCHOR-FREE GRAPH REALIZATION WITH APPLICATIONS TOMOLECULAR CONFORMATION [J] . PRATIK BISWAS, KIM-CIIUAN TOH, YINYU YE SIAM Journal on Scientific Computing . 2009,第3期

机译：大规模Noisyanchor无图实现的分布式SDP方法及其分子构造
3. A DISTRIBUTED SDP APPROACH FOR LARGE-SCALE NOISYANCHOR-FREE GRAPH REALIZATION WITH APPLICATIONS TOMOLECULAR CONFORMATION [J] . PRATIK BISWAS, KIM-CIIUAN TOH, YINYU YE SIAM Journal on Scientific Computing . 2009,第3期

机译：大规模Noisyanchor无图实现的分布式SDP方法及其分子构造
4. Importance of Runtime Considerations in Performance Engineering of Large-Scale Distributed Graph Algorithms [C] . Jesun Sahariar Firoz, Thejaka Amila Kanewala, Marcin Zalewski, Workshop on big data management in clouds;Euro-Par 2015 International workshops;Workshop on parallel and distributed computing education for undergraduate students;Workshop on algorithms, models, and tools for parallel computing on heterogeneous platforms;Workshop on large-scale distributed virtual environments;Workshop on on-chip memory hierarchies and interconnects: organization, management and implementation;Workshop on parallel distributed agent-based simulations;Workshop on performance engineering for large-scale graph analytics;Workshop on reproducibility in parallel computing;Workshop on resiliency in high-performance computing with clouds, grids, and clusters;Workshop on runtime and operating systems for the many-core era;Workshop on unconventional high performance computing;Workshop on virtualization in high-performance cloud computing . 2015

机译：大规模分布图算法性能工程中运行时注意事项的重要性
5. Application of Sustainability Principles for Green Engineering: Production of High-Value Platform Chemicals from Selective Biomass Fast Pyrolysis and Use of Safer Solvents for Liquid Chromatography [D] . Nallar, Melisa. 2020

机译：可持续性原理在绿色工程中的应用：从选择性生物量快速热解的高价值平台化学品的生产和使用更安全溶剂的液相色谱法
6. A Review of Distributed Optical Fiber Sensors for Civil Engineering Applications [O] . António Barrias, Joan R. Casas, Sergi Villalba 2016

机译：土木工程应用分布式光纤传感器的综述
7. PARM: Physics Aware Runtime Manager for Large-scale Scientific and Engineering Applications [O] . Yeliang Zhang, Salim Hariri, Jianwei Xiang, 2015

机译：paRm：物理意识运行时管理器，适用于大规模科学和工程应用
8. Parallelizing Molecular Dynamics Programs for Distributed Memory Machines: An Application of the CHAOS Runtime Support Library. [R] . Hwang, Y., Das, R., Saltz, J., 1994

机译：分布式存储器机器的分子动力学程序的并行化：CHaOs运行时支持库的应用。

GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel Applications

摘要

著录项

相似文献

相关主题

期刊订阅